Statistical properties of multistep enzyme-mediated reactions 
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Enzyme-mediated reactions may proceed through multiple intermediate conformational states 
before creating a final product molecule, and one often wishes to identify such intermediate struc- 
tures from observations of the product creation. In this paper, we address this problem by solving 
the chemical master equations for various enzymatic reactions. We devise a perturbation theory 
analogous to that used in quantum mechanics that allows us to determine the first ((n)) and the 
second (a 2 ) cumulants of the distribution of created product molecules as a function of the sub- 
strate concentration and the kinetic rates of the intermediate processes. The mean product flux 
V — d{n)/dt (or "dose-response" curve) and the Fano factor F = a 2 /{n) are both realistically 
measurable quantities, and while the mean flux can often appear the same for different reaction 
types, the Fano factor can be quite different. This suggests both qualitative and quantitative ways 
to discriminate between different reaction schemes, and we explore this possibility in the context of 
four sample multistep enzymatic reactions. We argue that measuring both the mean flux and the 
Fano factor can not only discriminate between reaction types, but can also provide some detailed 
information about the internal, unobserved kinetic rates, and this can be done without measuring 
single-molecule transition events. 



Enzyme-mediated reactions are ubiquitous in biology. 
Traditionally, they have been described as a two-step 
Michaclis-Menten (MM) process pQ, in which the enzyme 
and the substrate form a complex that can decay either 
back into the enzyme and the substrate, or forward into 
the enzyme and the product (see Fig. [T|V). The latter 
step is usually assumed to be irreversible, leaving three 
kinetic rates that specify the reaction. To determine 
these kinetic rates, a typical experiment measures the 
average rate of product formation (or product "flux" ) V 
as a function of substrate concentration S (also called a 
"dose-response" curve), producing a plot as in Fig. |2jV. 
Two pieces of information can be extracted from this 
plot: the saturating reaction rate Vmax and the Michaelis 
constant K, the substrate concentration at half of the 
maximum rate. Importantly, these two measurements 
do not specify the three underlying kinetic rates, thus 
they do not allow for a full identification of the reaction 
processes. 

The MM mechanism is not entirely general: many 
enzyme-mediated reactions consist of multiple interme- 
diate internal steps (such as conformational changes of 
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either the enzyme or the substrate, enzymes that occur 
in active and inactive states, etc.), each with its own for- 
ward and backward reaction rates. While measurements 
of substrate-enzyme complex formation and product re- 
leases are possible even on a single molecule level in enzy- 
matic kinetics [5] and in ion channel transport [3J U] , typ- 
ical experiments cannot resolve intermediate steps when 
measuring only the average reaction rate since they pro- 
duce qualitatively similar curves for V(S) . For example, 
the mean flux through an arbitrary complex ion chan- 
nel that holds at most one large transported molecule at 
a time is indistinguishable from that through a simple 
channel with just two internal states [5]. 

An interesting problem then is to determine which ex- 
perimental measurements could identify the multistep 
nature of an enzyme-mediated reaction without requir- 
ing measurements at intermediate steps. We suggest that 
this is possible by measuring not only the mean rate but 
also the variance in the rate of the creation of product 
molecules. Modern experiments can clearly perform this 
task in different experimental systems [21 IS]- 

Here we present a general perturbative approach for 
calculating the cumulants of a product molecule flux 
for a given enzymatic reaction scheme. To illustrate 
the method, we first apply it to the usual MM reaction 
(Fig. [IX). In addition to recovering the well-known re- 



2 



A 



B 



C 



D 



o 



cp a 



FIG. 1: Potential schemes for an enzyme-mediated reaction, 
in which substrate S is converted to product P. A: A simple 
Michaelis-Menten (MM) reaction. B: A MM reaction with 
an additional intermediate state (e.g. if the complex under- 
goes a conformational change before creating the product). 
C: A scheme in which the enzyme must become active (e.g., 
through phosphorylation) before mediating the reaction. D: 
A scheme in which the enzyme must become active before 
mediating the reaction, and the reaction leaves the enzyme 
inactive. 



suit for the mean rate of product formation as a function 
of substrate concentration, we derive the dependence on 
substrate of the Fano factor, the ratio of the variance in 
the number of product molecules to the mean. Impor- 
tantly, our approach is extendible, at least in principle, to 
an arbitrary enzyme-mediated reaction scheme, and we 
demonstrate this by analyzing three more complex reac- 
tion schemes, shown in Fig.[T]3-D. In the context of these 
reactions, we show that the dependence of the Fano fac- 
tor on the substrate concentration can produce qualita- 
tively different results for different reaction types, allow- 
ing one to distinguish them experimentally. In addition, 
we argue that quantitative features of the Fano factor 
measurements can constrain the values of the underly- 
ing kinetic rate constants more tightly than the mean 
rate measurements alone. Measurements of higher order 
product formation cumulants, if experimentally possible, 
would allow one to constrain properties of the reaction 
even more strongly. 



enzyme acts independently, that is, the substrate concen- 
tration is much larger than the enzyme concentration. 
This is equivalent to treating the process as if only one 
enzyme were present at a time. Furthermore, we assume 
that the concentration of the substrate is constant dur- 
ing each experimental measurement, and thus our master 
equation needs only to keep track of the enzyme's state 
and the number of created product molecules n. We note 
that both of these assumptions can be relaxed using re- 
cently developed techniques [8 , 9 . Finally, we only search 
for the distribution of the number of product molecules 
at times much longer than a typical enzymatic turnover 
time. 

We begin by demonstrating our method on the simple 
Michaelis-Menten (MM) reaction in Fig. [IJV. In the MM 
reaction, the enzyme will be in cither a free state Bora 
bound state ES. Therefore we partition the joint prob- 
ability distribution into two parts: P E , the probability 
that n product molecules have been created and the en- 
zyme is free, and P ES , the probability that n product 
molecules have been created and the enzyme is bound, 
yielding the CME [7] 



dP E 
dt 

dP° S 

dt 
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ES 
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where the rates are defined in Fig. [TJV, and S is the num- 
ber of substrate molecules. (Note that S can equivalently 
be thought of as the concentration of substrate as long as 
one appropriately rescales the rates) . The total probabil- 
ity of having n product molecules is then P n — P E +P ES . 

We note that the situation where the product 
molecules arc created and never destroyed or transformed 
back into the substrate is not physical, and additional re- 
actions that degrade the product in some way are needed. 
However, as long as we are interested in how many prod- 
uct molecules have been created, rather than are present 
at a given time, the creation, Eqn. ([I] [2]), and the decay 
reactions can be considered independently. 

Similar to Refs. M EU CCD D2] and others, we begin 
our solution of Eqns. (l][2) by defining the generating 
function 



METHODS: THE MICHAELIS-MENTEN MODEL 

Going beyond a simple description of the mean pro- 
duction of a particular molecule and making predictions 
about the intrinsic noise requires a stochastic description, 
such as the chemical master equation (CME) [7]. The 
CME describes the evolution in time of the joint prob- 
ability distribution for the copy numbers of all species 
involved in a reaction scheme. For the enzyme-mediated 
reactions we consider, we make the assumption that each 



G z (x) 



P* 



(3) 



with z e {E,ES}. Defining the vector \G) — 
(G E , G ES ) T , we may write the total generating function 
as 



G( X ) = (l\G) = G E + G 



ES 



(4) 



where (1| = (1,1) (note that we are adopting "bra-ket" 
vector notation commonly used in quantum mechanics 
literature) . The advantage of this formalism is that the 
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mean (n) and variance a 2 of the distribution of product 
molecules P n can be calculated from G(x) via 



d(\nG) 

d(i X ) 



o 2 = 



d 2 (lnG) 
d(i\) 2 



(5) 



Furthermore we note that having N (independently act- 
ing) enzymes is equivalent to taking G to G , so that 
extension to larger concentrations of enzymes is straight- 
forward. 

Now multiplying Eqns. ( l|2l by e lxn and summing over 
n produces 



d\G) = 
dt 

where, for the MM reaction, 



H\G), 



H = Ha = 



-faS k^ + k 2 e lx 
IdS -{k-i + k 2 ) / 



Ecm. ^ is solved by 



\G(t)) = e Ht \G ), 



(6) 



(7) 



(8) 



with an initial condition \Gq). If we write the matrix ex- 
ponential in terms of the eigenvalues Xj and eigenvectors 



\uj) of H as 



3 



(9) 



then, at t much larger than the typical enzyme turnover 
time, G(x) becomes 



G( X ) = Yy* t {l\u j ){u j \G ) 



„Ant, 
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where Ao is the eigenvalue with the least negative real 
part. Taking the log, we get 

lnG(x) = A < + ln((i| Mo )( Uo |G )) « A i, (11) 

since again, in the long-time limit, the first term dom- 
inates the second (for any bounded Go), and the ini- 
tial number of product molecules is forgotten. Recalling 
Eqn. ([5]) , it is clear now that one only needs to find the 
X-dependence of the least negative eigenvalue A of the 
matrix Ha in order to compute the cumulants of the 
product molecule distribution. In fact, writing Ao as a 
power series, 



Ao 



\- ,(m) (ixT 
^ ml 
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(12) 



it is clear that one only needs to know the coefficients up 
to m = 2 in order to compute the mean and variance of 
the distribution; i.e. 



n) = A 
2 



b i 



G 



x^h, 



(13) 
(14) 



and higher order terms are needed for higher cumulants 
only. Since Eqn. (|5| takes \ — > 0, this permits a per- 
turbative approach similar to that used in quantum me- 
chanics |13j . with x treated as a small parameter. 

Specifically, we write H = H^+H^ E" =1 fo) m /m! 
where (for the MM case) 



M 0) = 



—kiS k-i 
kiS —{k-i 




H 



(i) 




(15) 
(16) 



and we truncate at m — 2. We emphasize that this trun- 
cation does not introduce any further approximation if 
one is interested only in the first and second moments 
of the product molecule distribution. The least negative 



eigenvalue of H^ is A 
corrections are given by 
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Noting Eqns. ( 13p4 1, the rate of product formation 
V = d(n)/dt and the Fano factor F = <r 2 /(n) can now 
be written: 



V = X 



(i) 
> 

v( 2 )M(l) 



F = A ^/A 
For the MM case (Fig.^), this gives 

V A 

F a = 1 



V-max 
A 



S + K A 

s 

OLA 



(S + K A ) 2 '- 



(19) 
(20) 



(21) 
(22) 



where V™ ax = fc 2 , K A = (fc 2 +fc_i)/fei, and ola = 2fc 2 /fci. 
The expression for mean flux Va is well-known [1], and 
Ka is called the Michaelis constant; the expression for 
the Fano factor Fa is less familiar. 

This procedure is fully extendible to other more 
complicated enzyme-mediated reactions. The reaction 
scheme determines the master equation and thus H^ 
and H^\ Specifically, H^ is given by the Markov tran- 
sition matrix for the enzymatic states (disregarding the 
n variable), and H^ has a 1 marking every rate where 
the product gets created, and a —1 where it is destroyed. 
Then Eqns. ( 19|20 ) give the product formation rate and 
the Fano factor, and higher orders in perturbation theory 
would provide more cumulants. To illustrate the breadth 
of the method, in the next section, we apply this pro- 
cedure to three reaction schemes that include multiple 
intermediate reaction steps. 
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RESULTS: COMPLEX ENZYMATIC REACTIONS 
Product distribution statistics 

Many enzyme-mediated reactions involve intermediate 
steps, and it is instructive to illustrate our approach with 
three prototypical examples, shown in Fig. [TJ3-D. 

Reaction scheme B 

Fig. [TJ3 depicts a case in which the complex undergoes 
an intermediate step, such as a conformational change, 
before creating the product [J3]. This kinetic scheme is 
also equivalent to certain ion channels [5 . Such multistcp 
enzymatic reactions have been shown (including via our 
method here) to reduce noise in chemical reactions [15] . 
The master equation describing this system is 



dPl . 
dt 

dt 

dt 

which yields 



-kiSpZ + k^P^ + kzP^, (23) 
hSP,f - + k + )P* s + k_pz p , (24) 
fc+P,f s - + fc 2 )P r f p , (25) 



H 



Co) 




H 



(i) 



= fc 2 




\0 Oj 

The product flux and Fano factor are then 

Q 

VB = 



(26) 



(27) 



F B = 1 



S + K B 
5(5 + if B ) 
(5 + X s ) 2 



as 



(28) 
(29) 



where Vg"* = k 2 k+/(k 2 + k+ + fe_), if B = (/c 2 fc + + 
fe 2 fe_i + k-xk-)/{k x {k-2 + k + + fc_)), a s = 2k 2 k+/(k 2 + 
k + + k^) 2 , and i<T B = (fc 2 + fc+ + fc_ + k-i)/h. 



channel that can only transmit a single molecule at a 
time [16] . Alternatively, this scheme could be a model 
for the production-degradation and subsequent transla- 
tion of mRNA (E*) by ribosomes (5) into protein (P). 
Finally, this is also an extreme model of an enzyme that 
has internal states with different rates of product forma- 
tion, such as studied in [2]. For this scheme we can write 
the following master equation: 



dPl 

dt 
dP E * 
dt 



— k+Pn 



k+pf - k-pf + k 2 p*ii 



dP, 



E*S 



dt 

which yields 
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-(fc-l + fc 2 ). 



The product flux and Fano factor are then 

5 



Pc 



TV 



c 5 + A- c ' 
1 — ac 



(30) 

(31) 
(32) 

(33) 



(34) 
(35) 

(36) 
(37) 



where V™ ax = k 



2, ifc = (fe++^-)(fc2+fc-l)/(fe+fci), and 
a c = 2Aj2[l + fe-(fe+-*2-fc-i)/fe+]Ai- Note that these 
expressions reduce to those for the MM reaction (Eqns. 
( 2l|22 1) for fe_ — > 0, since this limit corresponds to the 
enzyme always being in the active state. Note also that 
since ac can be negative, Fc can be greater than 1 (and 
in fact it is infinite in the limit of rare activation k + — > 
0) due to the compounded noise from the independent 
stochastic processes of enzyme activation and complex 
formation. Under the interpretation of this scheme as 
protein translation, F» 1 corresponds to many proteins 
in a translation burst from a single rare mRNA. 



Reaction scheme C 

Fig. [Tp depicts a case in which the enzyme exists in 
an inactive and an active state. The enzyme switches au- 
tonomously between these states, but can only react with 
the substrate in its active form. Note that in this case we 
have two isolated reactions, since the enzyme remains in 
the active state when a product is produced. This scheme 
can be interpreted as a toy model for a voltage-gated ion 



Reaction scheme D 

Figure [Tp shows a third example of a more complex 
reaction scheme, in which an active enzyme transforms 
a substrate into a product and, in contrast to scheme C, 
returns to its inactive state in the process. The enzyme 
must switch back to its active state for a new reaction 
to occur. Similar dynamics have been found for the (3- 
galactosidase enzyme 0. Alternatively, this can be a 
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model for an enzyme that transfers a phosphate group 
to a substrate, and needs to reacquire a new phosphate 
group before continuing to function as an enzyme. For 
this scheme, we can write the following master equation: 



dP. 



E 



dt 
dt 
dt 

which yields 



k+P* 



kn P E " S 



faSP 



E* 



k-iP 



E*S 
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(38) 
(39) 
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H 



The product flux and the Fano factor are then 

v D = ^ nax s 

F D = 1 — OtD 



(41) 



(42) 



S(S + K' D ) 



(43) 
(44) 



where Vg 13 -* 
*-i)/(fci(fca4 



= k 2 k + /(k 2 + fc+), K D = k + (k 2 + 
fc 4 



2k 2 k + /(k 2 + k + ) 2 and 



(/c2 + fc+ + fc-i)/fci- Note that these expressions reduce to 



those for the MM reaction (Eqns. ( |21|22[ )) for k + -> oo, 
since this limit corresponds to the immediate reversion 
of the enzyme to its active state following a product for- 
mation. 

All four reactions in Fig. [T] use an enzyme to convert a 
substrate into a product, but as we have derived using the 
present method, the statistical properties of the product 
molecule distributions differ among the cases. 



|9j[37j|44])], however, can reveal qualitative and quantita- 
tive features that can differentiate among these schemes, 
which we outline here and summarize in Table Q] 

First, a distinction is possible based on the asymptotic 
value of F as the substrate concentration S saturates. 
For reaction schemes A and C, 



JU, C (S -> oo) = 1, 



whereas for reaction schemes B and D. 



Fb,d(S — > oo) = 1 - a B , d, 



(45) 



(46) 



where as and ajj are defined following Eqns. (29 I and 



( 44 1 respectively. This expression has a minimum value 
1/2 in the limits k 2 = k + ^> fc_ for B and k 2 = k + for 
D. Thus a saturation value of F that is significantly less 
than 1 offers evidence for reaction scheme B or D over A 
or C (see Fig. [2). 

Second, distinctions are possible based on the value 
F* at the extremum of the Fano factor as a function of 
substrate concentration S. For a MM reaction (case A), 
there is a minimum: 



1 



_OA_ 

4K A 



1 



1 



2 k 2 + k-i 



(47) 



which is always between 1/2 (for k 2 ^> fc_i) and 1 (for 
k-i 3> k 2 ). Similarly, for reaction scheme C, we obtain 



F c = l 



(48) 



where ac and Kq are defined following Eqn. (37 1. This 



expression also has a minimum value of 1/2 (for k + 3> 
fc_ and k 2 ^> fc_i), but, unlike in the MM case, it can 
become larger than 1 if k + (k + + fc_) < fc_(fc 2 + fc-i) 
(see Fig. Indeed, as mentioned, in the limit of rare 
activation k + — > 0, we find F* — > oo. 

Depending on the kinetic rates, reaction schemes B and 
D may or may not have a minimum for positive S (see 
Fig. [2] for an example of each). In the cases for which a 
minimum exists, 



Measurable differences between reaction schemes 

Since different reactions have different statistical prop- 
erties, it should be possible to use our methods and 
results to differentiate among the underlying reactions 
based on experimental observations. Here we demon- 
strate how basic measurements can differentiate among 
the four reaction schemes presented above. 

The mean product formation rates V for all four reac- 
tion schemes A, B, C and D shown in Fig. [I] Eqns. (21 



28 36 43 1 , are qualitatively similar functions of sub- 



strate concentration S, and it would not be possible to 
differentiate the schemes based on mean data alone (see 
Fig. pi. Measurement of the Fano factor F [Eqns. (22 



ub,d 



K' 



B.D 



B,D 



4 Kg d^K'i 



B.D 



K 



B,D) 



(49) 



where as, Kb, and K' B are defined following Eqn. (29 1 
and an, Kd, and K' D are defined following Eqn. (44 1. 
This expression has the minimum value 1/3 in the limit 
k + = k 2 ^> k-i for both schemes (and additionally k + ^S> 
fc_ for B). In the reaction scheme B, these limits reduce 
the system to a linear irreversible three-step cascade; an 
L-step irreversible cascade has minimum F* of 1 / L in the 
analogous limits |15j . Comparing with the MM minimum 
value of F* — 1/2, it is clear that a measured value of 
F* less than 1/2 is a strong indication that more than 
one intermediate step is present [2"D] . 
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TABLE I: Bounds on experimentally measurable quantities 
that are useful in distinguishing among schemes for enzyme- 
mediated reactions. A, B, C, and D refer to reaction schemes 
in Fig. [I] Star (*) denotes the extremum of the Fano fac- 
tor, such that F* is the minimum or maximum value and S* 
is the substrate concentration at which it occurs. K is the 
substrate concentration at which product formation rate V is 
half-maximal. Generally speaking, minimum bounds on all 
three quantities occur when forward reaction rates dominate 
backward rates, and maximum bounds occur when backward 
rates dominate forward rates; see text for more details. 



Lastly, distinctions can be made based on measure- 
ment of S* , the substrate concentration at which an ex- 
tremum in F occurs. For cases A and C, 



5* 
A,C 



= 1, 



(50) 



where K a and Kc are defined following Eqns. (22) and 



(37 1 respectively, and, as in all four cases, K is the con- 
centration at which V is half-maximal. For cases B and 
D, on the other hand (when there is a minimum), 



S 



B,D 



K> 



B,D 



K 



B.D 



K' BD - 2K B , D ' 



(51) 



where Kb and K' B are defined following Eqn. (29 1 and 
Kd and K' D are defined following Eqn. (44 1. This ex- 
pression is bounded from below by 1 (e.g. for k + 3> 
{fc_,fc2,fc_r} for B, or for k + ^>> {fc2,fc_i} for D), but 
can potentially be infinite (e.g. for fe_ = k-i ^> {fc2, &+} 
for B, or for Li » = fc| for D). This implies that 
if an extremum of the Fano factor occurs at a substrate 
concentration significantly different from that at which 
the mean product formation rate is half-maximal, it is a 
strong indication that more than one intermediate step 
is present. 

Table [I] summarizes these distinctions, and Figure [2] 
showcases the qualitative differences in the Fano factor 
curves among the four reaction schemes caused by dif- 
ferences in the underlying kinetics. For more compli- 
cated reaction schemes, such as multiple substrate bind- 
ing by the enzyme, modeled by a high Hill coefficient, 
the Fano factor curve would gain even more distinguish- 
ing features, such as additional extrema and/or inflection 
points [2Tj . 



Extracting reaction rates from data 

In addition to helping one distinguish among compet- 
ing reaction schemes, experimental measurement of the 
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FIG. 2: Mean product flux (also called dose-response curve) 
V and the Fano factor F versus substrate concentration S 
for the four cases in Fig. [T] A solid, B dashed, C dotted, 

for 
= 1, 



and D dot-dashed. Plots are of Eqns. (21 
V and Eqns. |22] [29] |37j [44| for F, with fci 



28 36 



431 



1, k 



k2 = 1, fc+ = 0.1, and fc_ = 0.01. Note that while there are 
no qualitative differences in V (and in fact all curves collapse 
when V is normalized by , (/ max and S by K, as seen in the 
inset), features can appear in F that signify that a process is 
more complicated than the single-intermediate case A. 



dose-response curve V(S) and the Fano curve F(S) can 
be used to determine the kinetic rates of the underlying 
biochemical reactions. If the structure of the biochemical 
reaction is known, analytical expressions for both curves 
in terms of the kinetic rates and the substrate concen- 
tration can be obtained using our method [see e.g. Eqns. 
( pTp2] [28f29| [36p7l [43J44)] and can be fit to experimen- 
tal data. Often times, measurements of the qualitative 
features of both curves (such as those highlighted in Ta- 
ble [i} are enough to extract the kinetic rates; for more 
complex reactions a full fit to the data would be neces- 
sary. Additionally we note that performing full fits of 
experimental data to the analytical expressions may also 
help in the original task of distinguishing among (or at 
least eliminating) different biochemical reaction schemes. 

The MM reaction is an example of a case in which mea- 
surement of the qualitative features is enough to extract 
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all kinetic rates. However, it is important to note that in 
order to do this, one needs both the dose-response curve 
and the Fano curve. In particular, one needs only to mea- 
sure the reaction rate at saturation V r jj nax , the substrate 
concentration Ka at which the rate is half maximal, and 
the minimum value of the Fano curve F* A . Then, from 



Eqn. (47) and the expressions following Eqn. (22 1, one 
obtains 



fc 2 



irmax 
V A i 

F A - 1 / 2 T/ ma: 

1-F% Va 

T/max 

V A 

2K A (1-F%Y 



(52) 
(53) 

(54) 



Instead of obtaining only fc 2 and a combination of k\ 
and by measuring only the dose-response curve (as 
is traditionally done for MM reactions), we now have 
analytical expressions for all three rates. 

For more complex reaction schemes, a similar anal- 
ysis can be performed to obtain analytical expressions 
for the kinetic rates in terms of the experimental data. 
However, it can be the case that not all rates can be de- 
termined unambiguously from measurements of V and 
F (for the reaction scheme B, for example, symmetries 
in the inverted expressions imply that measurements of 
V and F do not always uniquely determine the five un- 
known kinetic rates). When experimentally feasible, one 
may also compare higher moments of the measured prod- 
uct molecule distributions with those calculated via our 
method. 



DISCUSSION 

We have developed a method of solving chemical mas- 
ter equations for multistep enzymatic reactions using a 
perturbation theory approach analogous to that encoun- 
tered in quantum mechanics. With this method, find- 
ing cumulants of the distribution of product molecules 
is equivalent to diagonalizing a matrix with dimensional- 
ity equal to the number of internal states in the kinetic 
diagram of the reaction. Then obtaining the first m cu- 
mulants of the reaction can be done by solving the per- 
turbation theory to mth order, which is straightforward. 
In particular, the first two moments (n) and <r 2 together 
define the dose-response curve V — d(n) / dt and the Fano 
factor F = a 2 / (n) . As both are currently measurable in 
a variety of systems, comparing the calculated F to ex- 
perimental data can be used to identify the underlying 
structure of molecular reactions. 

We have applied this perturbation theory approach to 
four different reaction schemes, starting with the sim- 
plest Michaelis-Menten kinetics, and progressing to more 
complicated kinetic schemes with internal states. We cal- 
culated the dose-response curve and the Fano factor for 



each as functions of the substrate concentration. Im- 
portantly, while the dose-response curves for all of the 
considered reactions are qualitatively similar, prominent 
qualitative features of the Fano factor curve (such as its 
values at large substrate concentrations, as well as the 
position and values at its extremum) allow us to dis- 
ambiguate the considered reaction schemes. Performing 
detailed fits of the curves to experimental data (when fea- 
sible) can be an ultimate test for whether the underlying 
kinetic structure is known. 

For the MM reaction, knowing just a handful of fea- 
tures of the F(S) curve allows us to derive all three rates 
that completely define the kinetic scheme, while the en- 
tire dose-response curve is insufficient for this purpose. 
Similar results hold for the reactions with intermediate 
steps, but here the analytical treatment is more difficult, 
and often qualitative properties of F alone do not de- 
fine all of the underlying kinetic parameters. Instead, a 
quantitative fit of derived expressions for F(S) to exper- 
imental data would be required. 

We stress that the kinetic schemes analyzed in this 
article arc simple toy models only. However, extending 
our analysis to more complicated schemes to derive the 
first few cumulants of the product number distribution 
is not difficult, and it can be automated with just a sim- 
ple linear-algebra solver. In particular, calculation of the 
Fano factor for a signaling cascade as in [15] or for a 
complex network of single protein confirmations |17j is 
straightforward. It should be noted, however, that gener- 
ating sufficient experimental data to distinguish minute 
details of competing kinetic schemes is not easy. Our 
approach simplifies the problem somewhat since it does 
not require single-molecule kinetic data, as in [2j [17] , but 
it is based on measuring a mesoscopic, fluctuating flux. 
Still, ideally qualitative differences would dominate the 
disambiguation task, as emphasized with the toy models 
considered here. 



Acknowledgments 

The authors would like to thank the organizers, the 
lecturers, the participants, and the sponsors of the 2 nd 
q-bio Summer School on Cellular Information Processing 
in Los Alamos, NM. We are also thankful to Michael Wall 
for careful reading of the manuscript. WdR was sup- 
ported by the research program of the "Stichting voor 
Fundamenteel Onderzoek der Materie," which is finan- 
cially supported by The Netherlands Organization for 
Scientific Research. BCD was supported by NSF Grant 
DMR-0705167. AM was supported by NSF Grant DGE- 
0742450. NAS and IN were supported by DOE under 
Contract No. DE-AC52-06NA25396. IN was further sup- 
ported by NSF Grant No. ECS-0425850. 



t Electronic address: deronde@amolf.nl 
* Electronic address: bcd27@cornell.edu 
§ Electronic address: aim2121@columbia.edu 
■ Electronic address: nsinitsyn@lanl.gov 
** Electronic address: nemenman@lanl.gov 
[1] L. Michaelis and M. Menten, Biochem. Z. 49, 333 (1913). 
[2] B. P. English, W. Min, A. M. V. Oijen, K. T. Lee, G. Luo, 
H. Sun, B. J. Cherayil, S. C. Kou, and X. S. Xie, Nat. 
Chem. Biol. 2, 87 (2006). 
[3] T. K. Rostovtseva and S. M. Bezrukov, Biophys. J. 74, 
2365 (1998). 

[4] E. M. Nestorovich, C. Danelon, M. Winterhalter, and 
S. M. Bezrukov, Proc. Natl. Acad. Sci. (USA) 99, 9789 
(2002). 

[5] S. M. Bezrukov, A. M. Berezhkovskii, and A. Szabo, J. 

Chem. Phys. 127, 115101 (2007). 
[6] I. Golding, J. Paulsson, S. M. Zawilski, and E. C. Cox, 

Cell 123, 1025 (2005). 
[7] N. G. van Kampen, Stochastic processes in physics and 

chemistry (Amsterdam: North-Holland, 1992). 
[8] N. A. Sinitsyn and I. Nemenman, EPL 77, 58001 (2007). 
[9] N. A. Sinitsyn and I. Nemenman, Phys. Rev. Lett. 99, 

220408 (2007). 

[10] D. A. Bagrets and Y. V. Nazarov, Phys. Rev. B 67, 
085316 (2003). 

[11] I. V. Gopich and A. Szabo, J. Chem. Phys. 124, 4712 
(2006). 

[12] J. E. Hornos, D. Schultz, G. C. Innocentini, J. Wang, 
A. M. Walczak, J. N. Onuchic, and P. G. Wolynes, Phys. 
Rev. E 72, 51907 (2005). 

[13] D. J. Griffiths, Introduction to Quantum Mechanics 
(Prentice Hall, 1994). 

[14] C. Frenzen and P. Maini, J. Math. Biol. 26, 689 (1988). 

[15] T. Doan, A. Mendez, P. B. Detwiler, J. Chen, and 
F. Rieke, Science 313, 530 (2006). 

[16] A. Hodgkin and A. Huxley, J. Physiol. 117, 500 (1952). 

[17] C.-B. Li, H. Yang, and T. Komatsuzaki, Proc. Natl. 
Acad. Sci. (USA) 105, 536 (2008). 

[18] Note that since H is not symmetric, the eigenvectors do 
not satisfy \uj) = (mj| t , but rather they solve H\uj) = 
\j\uj) and {uj\H — \j{v,j\, respectively. 

[19] More precisely, Ho is a propensity matrix whose columns 
sum to zero, which means one of its eigenvalues is zero 
and the rest are negative [7]- 

[20] In all schemes A, B, C, and D, F* is dependent on kj 
through S* , which explicitly ensures that kiS = fe; this 
is a commonly known result, and it may be used by na- 
ture to suppress noise in natural signaling systems such 
as phototransduction [15] . 

[21] We leave this as an exercise for future q-bio Summer 
School students. 



